Bias of Purine Stretches in Sequenced Chromosomes
نویسندگان
چکیده
We examined more than 700 DNA sequences (full length chromosomes and plasmids) for stretches of purines (R) or pyrimidines (Y) and alternating YR stretches; such regions will likely adopt structures which are different from the canonical B-form. Since one turn of the DNA helix is roughly 10 bp, we measured the fraction of each genome which contains purine (or pyrimidine) tracts of lengths of 10 bp or longer (hereafter referred to as 'purine tracts'), as well as stretches of alternating pyrimidines/purine (pyr/pur tracts') of the same length. Using this criteria, a random sequence would be expected to contain 1.0% of purine tracts and also 1.0% of the alternating pyr/pur tracts. In the vast majority of cases, there are more purine tracts than would be expected from a random sequence, with an average of 3.5%, significantly larger than the expectation value. The fraction of the chromosomes containing pyr/pur tracts was slightly less than expected, with an average of 0.8%. One of the most surprising findings is a clear difference in the length distributions of the regions studied between prokaryotes and eukaryotes. Whereas short-range correlations can explain the length distributions in prokaryotes, in eukaryotes there is an abundance of long stretches of purines or alternating purine/pyrimidine tracts, which cannot be explained in this way; these sequences are likely to play an important role in eukaryotic chromosome organisation.
منابع مشابه
Analysis of 62 hybrid assembled human Y chromosomes exposes rapid structural changes and high rates of gene conversion
The human Y-chromosome does not recombine across its male-specific part and is therefore an excellent marker of human migrations. It also plays an important role in male fertility. However, its evolution is difficult to fully understand because of repetitive sequences, inverted repeats and the potentially large role of gene conversion. Here we perform an evolutionary analysis of 62 Y-chromosome...
متن کاملNucleotide bias causes a genomewide bias in the amino acid composition of proteins.
We analyzed the nucleotide contents of several completely sequenced genomes, and we show that nucleotide bias can have a dramatic effect on the amino acid composition of the encoded proteins. By surveying the genes in 21 completely sequenced eubacterial and archaeal genomes, along with the entire Saccharomyces cerevisiae genome and two Plasmodium falciparum chromosomes, we show that biased DNA ...
متن کاملPopulation genetics of tandem repeats in centromeric heterochromatin: unequal crossing over and chromosomal divergence at the Responder locus of Drosophila melanogaster.
The Responder (Rsp) locus in Drosophila melanogaster is the target locus of segregation distortion and is known to be comprised of a tandem array of 120-bp repetitive sequences. In this study, we first determined the large scale molecular structure of the Rsp locus, which extends over a region of 600 kb on the standard sensitive (cn bw) chromosome. Within the region, small Rsp repeat arrays are...
متن کاملNotI flanking sequences: a tool for gene discovery and verification of the human genome.
A set of 22 551 unique human NotI flanking sequences (16.2 Mb) was generated. More than 40% of the set had regions with significant similarity to known proteins and expressed sequences. The data demonstrate that regions flanking NotI sites are less likely to form nucleosomes efficiently and resemble promoter regions. The draft human genome sequence contained 55.7% of the NotI flanking sequences...
متن کاملVisualization and Signi cance of DNA Structural Motifs in the Campylobacter jejuni Genome
The genome sequence of Campylobacter jejuni NCTC11168 was analyzed in terms of DNA structural properties (intrinsic curvature, base-stacking energy, and DNA exibility) throughout the chromosome. In addition, we calculated the frequency of DNA repeats in C. jejuni and in chromosomes from 25 other species of the class Proteobacteria. Compared with the average, global repeats are underrepresente...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computers & chemistry
دوره 26 5 شماره
صفحات -
تاریخ انتشار 2002